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The Use of Item-Favorability Data as Evidence of 
Sex Bias in Interest Inventories 



Qb.lectives of the Inquiry 

The purpose of this investigation was to study the appropriateness 
of using item-favorability data as evidence of sex bias in interest 
inventories. The large differences which are evident between the mean 
scores of males and females on certain interest scales may be the direct 
res'ilt of differences existing In the response styles of men and women 
or they may be indicative of existing sex bias (National Institute of 
Education, 197^+). The literature, however, suggests that the results 
of interest inventories merely reflect basic differences in the interests 
of men and women (Anastasi, 1958, I968; Noeth, Roth & Eredlger, 1975). I* is 
understandable, therefore, that a recommendation made by the National 
Institute of Education that the items within a given interest scale should 
be balanced by sex with respect to favorability has been the subject of 
considerable controversy (HIE, 197^+). It is hoped that the results of 
this investigation will add new insight into the issues related to use 
of item-favorability data as evidence of sex bias in interest inventories. 



Instrumentation 



The data in this investigation were obtained from the administration 
of two scales. Machine Work and Clerical Work , from the Ohio Vocational 
Interest Survey . The two scales were selected because the pattern of 
scores generally obtained by males and females reflect q.uite dramatically 
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two widely held sex- role stereotypes. The two OVIS interest scales contain 
eleven activity statements which were selected to represent typical job 
activities performed by workers in a specified group of occupations-^. 
A five-option Likert response format is used with each item. The 
response options and corresponding scoring weights are: dislike very 
much, 1; dislike, 2; indifferent, 3; like, and like very much, 5* 
The range of possible scores is 11 to 55. The higher the score the 
greater the person's preference for the job activities associated with 
the particular cluster of occupations.. As indicated above, the two 
sex groups perform quite differently on the two scales. For the research 
sample, the mean score for the males on Machine Work was 13.1 score 
points higher than the mean score for the females. For Clerical Work , 
the mean score for females was 9*2 score points higher than the mean 
score for males. 

Methodology 

Sajjrple: The subjects used in this study were participants in the 
national standardization of the Ohio Vocational Interest Survey (OVIS). 
The sample consisted of 10,225 eighth-grade students from 39 school 
systems throughout the country^. 

1 Each OVIS scale is based on one or more of the 22h homogeneous 
worker-trait groups defined by the U.S. Department of Labor in 
the Dictionary of Occupational Titles , Third Edition. 

2 A more detailed description of the sample may be found in Chapter 
h in the OVIS Manual for Interpreting . 
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Analyses 

Three different procedures were used to study the item performance 
data collected during the OVIS standardization program. The first 
analysis was an examination of it em-favor ability. The chi-square statistic' 
was used to test for significant dirrerences between the two sex groups. 
For eeih item, a fourfold contigency -.able (Sex X Favorability) was used 
to obtain the cell frequencies. For the purpose of the study, a "Uke very 
much" response and a "like" response were designated as favorable responses. 
The remaining three response categories were designated as unfavorable 
responses. 

The second analysis was an examination of the item data for evidence 
of significant item-sex group interactions. A plotting procedure suggested 
byEcternacht {I97h) was used for this analysis. The assumption underlying 
this procedure is that the presence of item-sex group interactions can be 
used as evidence of possible sex bias. 

The final analysis was an examination of the item-total score cor- 
relations for the two sex groups. Each student's score on a given item 
(the item-response weight) was correlated with the student's total score 
(the sum of the eleven item weights). This is a standard statistical 
procedure for evaluating item homogeneity and construct validity for 
psychological scales. Sex differences in the item-total score correlations 
would suggest that the items in a given scale are functioning in a biased 
manner with respect to the stated construct. The significance of the 
difference between the male and female correlation coefficients was 
tested usiag the Fisher fi transformation (MdTemar, 1962). 
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Results 

The item-favorability data are reported in Table 1. It should be 
noted that all twenty-two chl square values were significant at the 
.01 levBl. Thus, with respect to these two sets of items, males and 
females did not view the items with the same degree of favorability. 
Furthermore, all eleven activities in the Machine Work scale were found 
to be favored more by males than females. In the Clerical Work scale, 
the reverse pattern was found. All eleven activities in this scale 
were favored more by females than males. Thus , in terms of the NIE 
Guidelines for Assessment of Sex Bias and Se x Fairness in Career 
Interest Inventories (197U), the two scales could be considered 
potentially sex biased* 

Figures 1 and 2 provide a graphical presentation of the item-sex 
group interaction analysis for Machine Work and Cl erical Work 
respectively. The solid,, straight line in each graph represents 
a hypothetical normal distribution of the cummulative item-delta 
differences deltas reflect the proportion of 

favorable responses. The broken, curved lines represent the confidence 
bands for the .05 level of significance. The points, which are plotted 
on normal probability graph paper, represent the item-delta, differences 
for males and females for the respective items. As shown in the two 
graphs, all of the plotted points fell within the designated confidence 
bands. Thus, the departure from normality was not found to be statis- 
tically significant at the .05 level. This, suggests that the item-sex 
group interaction was not significant. Item-sex group Interaction could 

6 

o 

ERIC 



-5- 

be concluded only in the case where the condition of normality had been 
rejected. The absence of a significant item-sex group interaction, 
therefore, suggests an absence of sex bias in the performance of the 
tvo groups on the respective interest scales, assuming that a constant 
bias is not inherent In the items themselves. 

Tables 2 and 3 contain item-total score corrtlational data for 
a random sample of 1000 males and 1000 females dravn from the larger 
sample of 10,225 eighth-grade students. For the Machine Work scale, 
the range of the correlation coefficients was .6o to .76 and .56 to .77 
for the males and females respectively. The median values were .72 for 
the males and .73 for the females. The pattern in the Clerical Work 
scale was similar. The range of coefficients was .50 to .75 for the 
males and .50 to .82 for the females. The median values were .68 and 
.71 for males and females respectively. 

As shown in Table 2, the differences between the 11 pairs of male 
and female correlation coefficients were not statistically significant 
at the .05 level. Therefore, the eleven Machine Work items were found 
to contain the same statistical properties for both sex groups with 
respect to stated construct. In Table 3, however significs^nt dif- 
ferences were found for five of the eleven pairs of correlations. 
This suggests that these five items may be functioning differently for 
males and females with respect to the construct identified as 
Clerical Work , However, the median correlation coefficients showed no 
significant difference overall. Thus, further analysis appears to be 
necessary to determine if the small, but significant differences have 
practical implications. 
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Conclusions and Implications 

Althotigh tte item-favorability data suggested possible sex bias 
in the two Interest scales, consistent support for this position was 
not found in the other item data. The item-sex groiqp interaction* analysis 
failed to provide any evidence of existing item-sex group interaction. 
The item-total score correlation analysis did detect a possible source 
of sex bias in the Clerical Work scale but no evidence of bias was found 
in the Machine Work scale . These findings raise serious questions re- 
garding the ai>propriateness of using item favorability data as evidence 
of possible sex bias. While additicual investigations will be necessary 
to confirm these findings, the conclusion reached at this time is that 
item favorability data by themselves do not appear to be valid indicators 
of sex bias in interest inventories. 



These findliigs help to point out the need for a workable operational 
definition of sex bias as the teim applies to interest meajsurement. 
The existing definitions, while politically expedient, tend to be 
a curious mixture of emotional subjectivity and psychcmetric objectivity. 
As a result, the available guidelines lack the specificity which would 
YTiakft them uaiRful criteria for assessing sex bias in interest inventories. 
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Table 2. Compari'>on or the Item-Total Score Correlations for 
Scale 2, Machine Wcrk. 



■ 


Males 


Femal es 




Sign 

1 p\/p1 


Item NO. 






r 




7m-7f 


9 


.60 


.693 


.56 


.633 


.058 


N.S. 


23 


.61 


.709 


.60 


.693 


.016 


N.S. 


98 


.72 


.908 


.69 


.848 


.060 


N.S. 


109 


.74 


.973 


.75 


.950 


.023 


N.S. 


130 


.74 


.950 


.77 


1.020 


.070 


N.S. 


157 


.71 


.887 


. 74 


acn 


.063 


N.S. 


219 


.76 


.996 


.75 


.973 


.023 


N.S. 


239 


.71 


.887 


.73 


.929 


,042 


N.S. 


243 


.72 


.908 


.73 


.929 


.021 


N.S. 


256 


.76 


.996 


.74 


.950 


.046 


N.S. 


268 


.72 


.908 


.71 


.887 


.021 


N.S. 


Median 


.72 


.908 


.73 


.929 


.021 


N.S. 
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Table 3. Comparison of the Item-Total Score Correlations for 
Scale 5, Clerical Work. 





Males 


Females 






Item No. 


1 








Zm-Zf 


Sign 
Level 


1 


.50 


.549 


.50 


.549 


.000 




31 


.54 


.604 


.57 


.648 


.044 


N.S. 


45 


.57 


.648 


.62 


.725 


.077 




55 


.69 


.848 


.75 


.973 


.125 


.05 


100 


.68 


.829 


.75 


.973 


.144 


.05 


112 

lie* 


.75 


.973 


.82 


1.157 


.184 


.05 


133 


.73 


.929 


.78 


1.045 


.116 


.05 


164 


.72 


.908 


.71 


.887 


.021 


N.S. 


200 


.66 


.793 


.64 


.758 


.035 


N.S. 


202 


.64 


.758 


.64 


.758 


.000 


N.S. 


261 


.70 


.867 


.76 


.996 


.129 


.05 


Median 


.68 


.829 


.71 


.887 


.058 


N.S. 
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